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Abstract 

We prove that critical multitype Galton- Watson trees converge after rescaling 
to the Brownian continuum random tree, under the hypothesis that the offspring 
distribution has finite covariance matrices. Our study relies on an ancestral de- 
composition for marked multitype trees. We then couple the genealogical structure 
with a spatial motion, whose step distribution may depend on the structure of the 
tree in a local way, and show that the resulting discrete spatial trees converge once 
suitably rescaled to the Brownian snake, under some suitable moment assumptions. 
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1 Introduction and main results 

1.1 Motivation 

Multitype Galton- Watson (GW) processes arise as a natural generalization of usual GW 
processes, in which individuals are differentiated by types that determine their offspring 
distribution. They were first studied in 1947 by Kolmogorov and his coauthors. We 
refer to |7J Chapters 2 & 3] for a very nice introduction to these processes. It turns 
out that their analysis is considerably eased under an irreducibility assumption, namely, 
that every type has a positive probability to eventually 'lead' to all others. Under this 
hypothesis, one can use variants of the Perron- Frobenius theorem, which allow to quantify 
the asymptotic behavior of iterates of the mean operator of [7j, and obtain qualitative and 
quantitative results on the GW process. Informally, the large-scale aspects of irreducible 
multitype GW processes are similar to that of monotype GW processes whose mean 
offspring distribution is the Perron eigenvalue of the mean operator. On the other hand, 
Janson [Hj has shown that if the irreducibility assumption fails to hold, many different 
behaviors can occur. 

The aim of the present paper is to investigate the ancestor trees and forests associated 
with irreducible GW processes, when the total number of types is finite. Under criticality 
hypotheses on the mean matrix, and a finiteness hypothesis on the covariance matrices of 
the offspring distributions, we show that the height process of these forests converges to a 
reflected Brownian motion, and hence behaves asymptotically in a similar way as mono- 
type processes (Theorem [IJ. Similar results are proved for family trees conditioned on 
the number of their individuals, under extra exponential moments assumptions (Theorem 
E} . Although we do not focus on a continuum tree formalism here, this says roughly that 
under these hypotheses, multitype GW trees conditioned to have n individuals converge 
once suitably renormalized to the Brownian continuum random tree £[]. 

We also show that these ancestor trees and forests, when coupled with a spatial motion 
whose step distribution may locally depend on the tree structure, converge to the so-called 
Brownian snake ^2j, under mild extra hypotheses on the spatial motion step distribution 
(Theorem QJ). 

Applications of these results to random planar maps are discussed in the companion 
paper [T3j. The approach of the present paper follows quite closely that of [T3] . 

1.2 Multitype Galton- Watson processes 

Let K G N = {1,2,...} be a positive integer. Write [K] = {1,2,..., K}, and identify 
ZM,1 M with Z K ,R K . Suppose given distributions G [K]) on the space of 

integer- valued non-negative sequences of length K. We will often use the notation /x as a 
shorthand for the M^-valued measure G [K]). 

A i^-multitype GW process with offspring distributions /x is a Z+-valued Markov 
process 

Z n = (Z n (i), i G [K]) , n>0 
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such that the law of Z n+1 given Z n = z = (zj, i G [K]) is the same as Y^ie\K] ^1=1 (0: 
where the vectors XW(Z),z G [if],/ > 1 are all independent, and X^(/) has law jiiW for 
all / > 1. 

Otherwise said, this process can be considered as a model for population evolution 
where each individual is given a type in [K], and where each type- 2 individual gives birth 
to a set of individuals with law fi^\ this independently over individuals (although the 
different components of may well be dependent). 

We say that the process (or the measure fx) is non-degenerate if there exists at least 
one i G [K] so that //W({z : ^2 ,■ Zj ^ 1}) > 0. Failure of this last assumption entails that 
all particles a.s. give birth to exactly one particle, and the study of the process boils down 
to that of a Markov chain with values in [K]. All the processes that we consider here are 
assumed to be non-degenerate. 

For i, j G [K], let 

m a = ^ (0 (W) 

be the mean number of type-j offspring of an type-2 individual. We let M M = {pm)i,je[K\ 
and call it the mean matrix of /j,. 

Definition 1 The mean matrix ( or the offspring distribution fx) is called irreducible, if 

for every i,j G [K], there is some n G N so that > 0, where is the ij-entry of 
the matrix M™. 

Notice that the n in the definition may depend on the choice of so that the 
definition is distinct of that of aperiodicity, namely that the above property holds jointly 
for every for some common n. 

With the irreducibility assumption, the Perron-Frobenius Theorem recalled below 
(Proposition EJ) ensures that the eigenvalue q of M M with maximal modulus is real, pos- 
itive, and simple, and that a non-zero eigenvector of M p with eigenvalue g has only 
non-zero entries all having the same sign. 

Proposition 1 (|2J) Suppose that the process (Z n ,7i > 0) is non-degenerate. Then it 
eventually becomes extinct a.s., whatever the starting value, if and only if g < 1. The pro- 
cess (or the distribution /x) is called sub-critical if g < 1, critical if g = 1 and supercritical 
if g > 1. 

It will be useful to introduce the generating functions cp = {^ % \i G [K]) defined by 

^»(8)= ^/i«({z})s Z 

where s = (sj,z G [K]) G [0, 1] K and s z = Hieix] 5 ?- With these notations, we have 
niij = d(pW/dsj(l), where 1 is the vector of [0, 1] K with all components equal to 1. 
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For 1 < i, j, k < K, define 

We say that /x has finite variance if 

Qf k <oo for all i,j,ke[K}. (1) 

Under this assumption, for each i, (Qjl, 1 < j,k < K) is the Hessian matrix of the convex 
function </?W evaluated at 1, hence the matrix of a non- negative quadratic form on WL K , 
which we call Q w (s),s G 1R X . 

Assuming M M irreducible and /x non-degenerate, critical and with finite variance, we 
let a, b be the left and right eigenvectors of M M with eigenvalue 1, chosen so that a - 1 = 1 
and a ■ b = 1, where x ■ y is the scalar product of the vectors 1,1/6 M. K . Let 



K 

^a j g»(b) = A /a _ Q(b) ) (2) 

i=l 

where Q(s) is the iT-dimensional vector (Q^'(s), 1 < i < K). This should be interpreted 
as the 'variance' of the offspring distribution of the multitype process, as it plays a role 
similar to the variance for monotype GW processes in the asymptotics of the survival 
probability, see [T9] . 

Basic assumption (H). In the sequel, unless specified otherwise, we will exclusively 
be concerned with irreducible, non-degenerate, critical offspring ditributions with finite 
variance. Notice that criticality implies finiteness of all coefficients of M M . 

1.3 Multitype trees and forests 

We now add a genealogical structure to the branching processes, by endowing it into a 
tree-valued random variable. For n > let U be the infinite-regular tree 

U = |J rc, 

n>0 

where N n is the set of words with n letters, and by convention N° = {0}. For two 
words u, v, we let uv be their concatenation and \u\, \v\ their length (with the convention 
|0| = 0). If u is a word and iCWwe let uA = {uv : v G A}, and say that it is a prefix 
of v if v G uU, in which case we write u h v. A planar tree is a finite subset t of U such 
that 

• G t, it is called the root of t, 

• for every u ElA and i G N, if ui G t then u G t, and uj G t for every 1 < j < i. 



dsjds k 



A). 
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We let T be the set of planar trees, which we simply refer to as trees in the sequel. For 
a tree t 6 T and net, the number Ct(u) = max{i G Z + : ui G t}, with the convention 
uO = u, is the number of children of u. We say that v is an ancestor of u if v h u. An 
element u G t is called a vertex of t, and the length \u\ of the word w is called the height 
of w in t. The vertices of t with no children are called leaves. For t a planar tree and u a 
vertex of t, we let t u = {v G U : uv G t}, and we call it the fringe subtree rooted at u (it 
is trivially checked that it is indeed a tree). The 'remaining part' [t] u = {u} U (t \ ut u ) 
is called the subtree oft pruned at u. Any planar tree t is endowed with the linear order 
which is the restriction to t of the usual lexicographical order -< on U (u -< v if u h v or 
if u = toii', f = un/ where u[ < v[). We call it the depth-first order. 

In addition to trees, we will consider forests, which are defined as nonempty subsets 
of U of the form 

f = (Jfct( fc ), 

k 

where (t^)) is a finite or infinite sequence of trees, which are called the components of 
f. We let T be the set of forests. The quantity Cf(tt) (number of children of u G f) is 
defined as for trees, and we let Cf (0) G NU {oo} be the number of tree components of 
f. We define f n = {v : uv G f} G T if w G f, and f u = otherwise, so in particular 
fit, 1 < A; < Cf (0) are the tree components of f . Also, let [f] u = {u} U f \ itf„ G T . If u G f 
is a vertex of the forest, we call |w| — 1 the height of u. Notice that the notion of height 
of a vertex relies on whether we are considering the vertex to belong to a tree or a forest, 
the reason being that we want the roots 1, 2, 3, . . . (or floor) of the forest to be at height 
0. There should be no ambiguity according to the context. 

A K-type planar tree, or simply a multitype tree if the number K is clear from the 
context, is a pair (t, e t ) where t G T and et : t — > [K]. We let T^ K > be the set of i^-type 
trees. For u G t, e t (u) is called the type of u. If (t,e t ) G T^ K > and for u G t, we let 
(t,e t ) u G T^' be the pair (t n ,e tu ) where e tu (v) = e t (uv). Similarly, [(t,e t )]„ is the tree 
[t] u marked by the restriction of e t to [t] u . Similar definitions hold in a straightforward 
way for marked and .fT-type forests (f , ef), whose set is denoted by T^ K \ 

In the sequel, we will often denote the marking functions et, ef by e when it is free of 
ambiguity, and will even denote elements of T^,J^ by t or f, i.e. without explicitly 
mentioning e. It will be understood then that t u , f u , ... are marked with the appropriate 
function. We let, for i G [K], 

X {K) = {t G T^) : e(0) = i} , 

and for x = (xj) a finite or infinite sequence with terms in [K], 

?™ = {f G : e(j) = x 3 Vj}. 

For t G T^ K } and i G [K], we let = {u G t : e(u) = i}, and fW is the corresponding 
notation for f G . 

Let = Un>ot-^'] n ^ e ^ ne se ^ °f finite^ possibly empty [A']-valued sequences, and 
consider the natural projection p : Wk ~ > where p(w) = (pj(w),z G [K]) and 
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Pi(w) = #{j : Wj = i} counts the number of elements of w equal to i. Notice that for 
every K-type tree t, any u G t determines a sequence w t (w) = (e(m), 1 < i < ct(u)) G Wk 
with length |w t (M)| = Ct(u). The vector p(w t (u)) counts the number of children of u of 
each type. 

Finally, we will frequently have to count the number of ancestors of some vertex of a 
tree or a forest, that satisfy certain specific properties. For t G and u G t, we let 

Anc^ (P(t>)) be the number of ancestors v \- u that satisfy the property P. For instance, 
Anc^ (e(v) = i, c±{v) = k) counts the number of ancestors of u with type i and k children. 

1.4 Galton- Watson trees 

Let C = (C [K]) be a family of probability measures on the set Wk- We call £ an 

ordered offspring distribution. It is said to be non-degenerate (resp. critical, resp. to have 
finite variance) if the family of measures /x = p*C, := (p*C ,i G [K]) is non-degenerate 
(resp. critical, resp. satisfies (JTJ)), where p*C is the push-forward of Q l > by p. 
For z G [K], we now construct a distribution on %^ such that 

• different vertices have independent offspring, and 

• type-j vertices have a set of children with types given by a sequence w G W# with 
probability C ( w )- 

To do this, let (Wj, = (W*(Z), 1 < / < |WjJ), 1 < i < K, u G U) be a [iiT] x W-indexed 
family of independent random variables such that has law Then recursively, 
construct a subset idlA together with a mark e : t — > [K] by letting G t, e(0) = and 
if u G t, e(it) = j, then wZ G t if and only if 1 < I < |W{|, and then e(ul) = Wl{l). 

It is straightforward to check that t has the properties of a planar tree, except that it 
might be infinite. Moreover, it is straightforward from the construction that the process 

Z„(t) = (# {u G t : |«| = n, e(u) = i} , z G [AT]) , n > , 

is a multitype GW process with offspring distribution /x = p*£, and started from a single 
type-z individual. In particular, under the criticality assumption g = 1, this process 
becomes extinct a.s., so that t is finite a.s. and hence is a tree a.s.. In this case, we let 
Pp, or simply be the law of t on T^ K \ The probability measure is entirely 
characterized by the formulae 

p«(T=t)=nc (e(u)) (w t («)), 

ugt 

where T : T^ K > — > T^ K > is the identity map and t ranges over finite i^-type trees. 

Similarly, if x = (xj, 1 < j '• < r) G Wk, we define P x as the image measure of 
® - =1 P (Xl) by 

(t ( i), . . . ,t( fc )) i — > \J fct(fc), 

l<fc<r 
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i.e., it is the law that makes the identity map F : — ^ the random forest 

whose tree components Fi, 1 < j < r are independent with respective laws p( z «). A 
similar definition holds for an infinite sequence x G [K] . It will be convenient to use the 
notation P r x for pi x u-,x r ) wri en x G [K] n . 

1.5 Convergence of height processes 

For t G T, we let = u t (0) -< u t (l) -<...-< u t (#t — l) be the list of vertices oft in depth- 
first order. When there is no ambiguity on t, we simply denote them by w(0), w(l), . . .. 
Let (if* = \u(n)\,n > 0) be the height process of t, with the convention that = 

for n > #t. If f G we similarly let ttf(0) -< Uf(l) -< . . ., or simply u(0) , u(l) , . . . 
be the depth-first ordered list of its vertices (w(0) = 1), and define (H^,n > 0) by 
Hn = (\ u ( n )\ ~ l)l{n<#f-i} (again, the convention differs because we want the floor 
1, 2, ... of f to be at height 0). Also, for n > and f G let be the first letter of 
u(n) with the convention that for n > #f, it equals the number of components of f. 
For (t, e) G T (x) and i G [K], we let 

A*(n) = #{0 < k < n : e(u(fc)) = z} 

be the number of type-i individuals standing before the n + 1-th individual in depth-first 
order. The quantity Af is defined similarly for (f , e) G !F^ K \ 

Theorem 1 Let £ be an ordered offspring distribution such that n = p*£ satisfies (H). 
Recall the notations a, b, a around (0|). Then 

(i) Under P x , for some arbitrary x G [K] N , the following convergence in distribution holds 
for the Skorokhod topology on the space D(R + ,R) of right- continuous functions with left 
limits: 

(%*,.>o) (Vi,.>o) . 

where B is a standard one- dimensional Brownian motion. 

(ii) For every i G [K], ifi is the constant sequence (i, i, . . .), then, under P 1 , the following 
convergence in distribution in D(]R + ,R) holds jointly with that of (i): 

where (L®,t > 0) is the local time of B at level 0, normalized as the density of the 
occupation measure of B at before time t. 

(iii) Moreover, for any x, 

(^^l,s>o) ^( W >0), ie[K\. 

\ n J n-*oo 

Here, the convergence is convergence in probability under P x ; for the topology of uniform 
convergence over compact subsets o/R + . 
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Note that Theorem ^ could be also stated purely within a tree formalism, without 
reference to height processes. In a rough way, Theorem says that multitype GW forests 
converge once properly rescaled to a random forest, for a certain topology on the set of 
tree-like metric spaces. This limiting forest is made of tree components that are described 
by a Poisson process whose intensity measure is the cr-finite Brownian continuum tree 
measure of [3]. 

Let us comment on this result. First, (i) says that the height process of a multitype 
forest, when properly rescaled, always looks the same as a reflected Brownian motion with 
some prescribed scale factor, whatever the roots are. Moreover, the scale factor depends 
only on fi = p*£, meaning that the exact way in which each set of children is ordered 
does not affect the asymptotic distributional shape of the forest. 

However, the value of x actually matters if one wants to get a closer look at the 
'limiting forest'. Indeed, (ii) says that if one wants to extract every single tree of a forest 
grown from a type-i floor, one should proceed at a certain speed which does depend on 
i. In particular, in a general mixed floor as in (i), when taking a tree component in the 
limiting forest, one is in general unable to recover the rank of the tree it comes from in 
the discrete picture. 

Last, (hi) implies that 

n~ dA i ( [ns\ )1{ S>0 } — > a { dsl {s > } , 1 < % < K , 

n— >oo 

for the topology of vague convergence of measures, hence showing that a provides the 
asymptotic relative weights of different types, which are not influenced by the types of 
the roots. This is known as the convergence of types theorem, see j2], and it is reproven 
by different methods in Proposition H below. Theorem Q (hi) gives the extra information 
that all types are homogeneously distributed in the limiting tree. 

We mention that the topology for the weak convergence of (i) and (ii) could simply 
be the the uniform topology over compact subsets, since all limits are continuous. 

We also obtain as a corollary the following theorem of ^Hj, in the more general case 
of irreducible mean matrix (^H] is in the aperiodic case). 

Corollary 1 Let ht(t) be the maximal height of a vertex in t. Under the same assump- 
tions, as n —>■ oo, we have 

Oh. 

nP w (ht(T) >n) - 



a-Q(b) 



Conditioned versions of Theorem ^ also hold. We say that /x (or £) has small expo- 
nential moments if there exists e > such that 

sup ^ exp^jz^/x^^z) = sup ^ exp(e|w|)(^(w) < oo , (3) 

where |z|i = z\ + . . . + zk- in the following statement, as well as in all statements in the 
paper involving conditioned laws, we make the assumption that n goes to infinity along 
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some subsequence, so that all the conditioning events that are considered have positive 
probabilities. 

Theorem 2 Assume that hypothesis (H) holds and that fi has small exponential moments. 
Then for every i,j G [K], the following convergence in distribution holds on ©([0, 1],M): 



where B cx is the standard Brownian excursion with duration 1. 

Moreover, {{#T)- X Al{[#Tt\) , < t < 1) under pW(-|#T^ = n) converges in prob- 
ability to (akt, < £ < 1) /or £/ie uniform norm as n — > oo, /or ever?/ z, j, fc G [i^]. 

The main ingredient used to prove Theorems ^ and El is the following result: 

Proposition 2 (Corollary 2.5.1 in [4|, Theorem 3.1 in [3J) Theorems^ and\^ are 
true in the case K — 1. 

To be completely accurate, Theorem 2.5.1 in jl] gives the convergence in distribution 
of ^ _1//2 (^ s j, HF ns \, s > 0) to (o~B s , 2{B S — I s )/a,s > 0), where B is a standard Brownian 
motion with infimum process I s = inio< u < s B u , and where V F is the so-called Lukaciewicz 
walk of F (see the proof of Proposition |H1 for the definition), whose only property we need 
at this point is that inf <fc<n V F = — T^. This entails that n _1//2 (Y^ s j, HF,, s > 0) 
converges in distribution to (—crI s ,2(B s — I s )/a,s > 0), which by Levy's theorem has 
same distribution as (crL®, 2\B s \/a,s > 0). 

Also, Duquesne jH] makes the assumption that the offspring distribution is aperiodic 
to avoid conditioning events of zero probability, but the proofs still work by considering 
subsequences as we do. The conditioned version of Proposition |21 was first stated in 
Aldous [l] , using the so-called contour process, rather than the height process, to encode 
the discrete trees. We stress that in [TJ 0] , the authors do not assume that the offspring 
distribution has small exponential moments, and we expect Theorem El to hold without 
this extra hypothesis. As a matter of fact, by making occasional changes in the proofs 
below, one can show that a sixth moment for /x is sufficient, and we suspect that a second 
moment condition is enough. 

We finally stress that Proposition El is in fact a particular case of Duquesne and Le 
Gall's results [HE], which deal with the case of offspring distributions belonging to other 
stable domains of attraction than the Gaussian one. 

The idea of the proof of Theorems ^ El wm be to use an inductive argument on K in 
order to apply the foregoing proposition. We rely strongly on an ancestral decomposition 
for multitype trees adapted from jTTj and to be developed in Section l2~T| in which we also 
state some facts about monotype trees and the Perron-Frobenius theorem. The proof of 
Theorem^ is then given in Sections 12.51 and 12.61 




under 
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1.6 Convergence of multitype snakes to the Brownian snake 

Let us now couple the multitype branching process with a spatial motion. As in [TH] . 
we are interested in the case where the motion, which is parametrized by the vertices of 
the tree, has a step distribution around a given vertex that may depend locally on the 
tree, through the type and children of the vertex. We stress that our method could most 
likely be applied to other kinds of step distributions that depend on the structures of 
neighborhoods of the vertices which are not too large (i.e. have uniformly negligible scale 
compared to the height of the large trees) . 

Consider a family of probability distributions v iV] respectively on Rl w l, and indexed 
by types % G [K] and w G Wk- For (t,e) a multitype tree and every u G t with type 
e(u) = i and children vector w t (u) = w, we take a random variable (Y u j, 1 < j < |w|) 
with law i/j )W , independently over distinct vertices. Let v t be the law of the random vector 
(Y u ,u G t) thus obtained, with the convention that Y = 0. We let 

T (J ° = {(t,e, (y u ,ue t)) : (t,e) G T<*\y G R*} 

be the set of multitype trees with spatial marks on the vertices, and let pW be the 
probability measure 

Similarly, for (f , e) a multitype forest we let z/f be the law of the random variable 
(Y u ,u G f), where Y n = for any n G N and the random vectors (Y ui : 1 < i < Cf(u)) 
for u G f are independent with respective laws v e (u),w t (u)- For x = (xi,x 2 ,...) a finite 
or infinite sequence of types we let P x = dP x ((f, e)) Cg> d^f(y), which is a probability 
distribution on the set 

F m = {(t, e, (y B , u G t)) : (t, e) G T^j G R*}. 

Notice that in the definitions of PW and P x , only the measures z/j jW for (i, w) such 
that C ( w ) > matter. By convention, we let z/j jW be the Dirac mass at G M' w ' for 
all irrelevant indices. We say that the family (z/j jW ,z G [i^],w G WW) is non-degenerate 
if u i w is not a Dirac mass for at least one of the relevant indices (i, w), so that there is 
'some randomness' in the spatial displacement. We say that i> = {y i>Yn i G [-K"],w G Wk) 
is centered if all distributions 

For (t,e,y) G T^, define 

Su' e,y = Vv, 

v\-u 

and let S 1 ^ stand (a little improperly, but for lighter notations) for 5 , *'^ y , with the conven- 
tion that it equals for k > #t. A similar definition holds for S* f where (f, e,y) G ¥^ K \ 
where we use the convention y = 0. In the following statement and in the sequel, | - | a is 
the Euclidean norm of the vector x. 

Theorem 3 Assume n = p*£ satisfies (H) and admits some exponential moments, and 
that v is non- degenerate and centered. Suppose also that every z/j )W admits a moment 
Mj, w = (fj, w , bl^)? f or some £ > 0, such that 

sup M,, w = 0(|w| D ) (4) 

i£[K] 
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for some D > (here \y\2 stands for the Euclidean norm of y). Write 



\ 



J2 a *Yl C W (w)^^.K w ,yJ> G(0,oo) 

ie[K] wGWk j=l 



Then for any x e [K] , under P x , the following convergence in distribution on D( 
holds jointly with that of (i) in Theorem^- 



H-j- 



1 rSt.uS> O) 



n 1 / 4 L J / n->oo \ V cr 



where conditionally on the Brownian motion B of (i) m Theorem ^ R is a Gaussian 
process with covariance 

Cov(R s ,R s i)= inf \B U \. 

sAs'<u<s\/s' 

7/x = (z, z, . . .) for some i G [iT], then the convergence holds jointly with that of (ii) in 
Theorem[l\ 

The analogous statement for conditioned laws is: 

Theorem 4 Under the same hypotheses as Theorem^ the following convergence in dis- 
tribution on D([0, 1],K) holds jointly with that of Theorem^- for every i,j e [K], 



S -^P-,0<t<l) under pW(.|#r^ = n) f E, L^—Rf, < t < 1 
n 1 / 4 / n^oo \ \/ o" A /a7 



j 

where conditionally on the Brownian excursion £> cx of Theorem 0, -R cx is a Gaussian 
process with covariance 

Cov (R?,R7)= inf B?. 

sAs'<u<sVs' 

Remark. By contrast with ^2] and ^H], our hypothesis on the spatial displacement is 
an 8 + ^-moment assumption rather than a 4 + ^-moment assumption. We believe such 
a weaker hypothesis to be sufficient, but were not able to prove it, essentially because we 
could not prove what we believe to be the best Holder norm bounds in Proposition |HJ See 
the remark after the latter's statement. 



Note. In this paper, we will make a frequent use of exponential bounds for real sequences 
(x n , n > 0), namely that \x n \ < exp(— n £ ) for some e > and large enough n. To simplify 
notations and avoid referring to changing e's, we write x n = oe(n) in this case. 
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2 Proof of Theorem U 

2.1 Ancestral decomposition for multitype Galton- Watson trees 
and forests 

Let £ = (C^, • • • , C ) be a non-degenerate critical ordered offspring distribution. For 
i G [K], define the size- biased measure 

C«(w) = ^^C (? )(w), wgW a -, 

and notice that these are probability measures on Wk by the definition of b, since they 
are pushed by p to the measure 

^)(z) = ^-V)(z), zGZ*, 

and they do not charge the null sequence 0. On some probability space (Q,A,P), let 
((WJj, W^, l l u , i G 6 U) be a family of W-indexed independent random vectors, 

such that has law £W, has law £W, and 

P& = k\wi)= ^ • 

p(Wi) ■ b 

Otherwise said, is equal to fc with probability proportional to b Wk given = w. 
Fix some % G [i<T]. Recursively, we build a set t, a mark e : t — > [A'] and a sequence 
(Ki; ^ > 0) by first letting Vo = G t, e(0) = z, and given Vo, e(V ), . . . , V n , e{V n ) have 
been constructed with e(V n ) = j, we let V n+ i = V n ly and e(V n+ i) = Wy (l v ). Then, for 
u G t with e(u) = j, 

• if it = V n for some n, then ul G t if and only if 1 < / < |W£|. For such I, 
e(ul) = W>{1), and 

• otherwise, ul G t if and only if 1 < I < |Wj|. For such /, = W^'(Z). 

The set t thus obtained has the properties of a tree, except that it is infinite. More 
precisely, it consists of an infinite 'spine' Vo,Vi, ... of distinguished vertices, which is 
interpreted as an infinite ancestral line along which individuals of type % have a Q^- 
distributed offspring sequence, among which each element j is selected as a distinguished 
successor with probability proportional to bj. Then, non-distinguished individuals have a 
regular GW descendence with offspring distribution £. It is easy to prove that the spine 
is the unique infinite simple path of vertices in t starting from 0. 

We let pW ,h be the law of ([t]y^, V h ). It is a distribution on the set {(t, e, v) : (t, e) G 
T^ K \ v G t} of pointed trees, on which we let (T, V) be the identity map. For any finite 
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sequence x of types, let also P^' h be the law under which (Fi,l ^ are pairwise 

independent with respective laws (P( Xl \ I ^ j), P^ x ^' h . 

Notice that by construction, the types of the distinguished individuals V ,Vi, . . . form 
a Markov chain, whose transition law is 



zezi 



-m jf . 



(5) 



The stationary distribution of this Markov chain is easily checked to be the vector 
{a>ibi, 1 < i < K), because of the normalization a • b = 1. This will be useful in the 
sequel. 

Lemma 1 For any finite sequence of types x = (xj, 1 < j < r), and any non-negative 
functions Gi,G2, 



E y 



.veF 



j=l h>0 



G 1 (V,F)E^ V » [G 2 (T)} 



Mv) 



(6) 



Proof. Let (f , ef) be a .fT-type forest, u a leaf of f (i.e. a vertex with no child) and 
(t, e t ) a if-type tree with e t (0) = ef(u). Then it is enough to show the result for 
Gi(u, f)G 2 (t) = l{ w ,f,t}) as one can then use linearity of the expectation. In this case, 
the left-hand side of (JHJ) is equal to P X (F = [f, u, t]), where [f, u, t] is the only forest 
(f, e') G T^ K > containing u with [f'] u = f and i' u = t. We let S = {v : v h it, v ^ u} and 
A — {y : v S U {it}, -ru G £}, where -ii> is the father of v, i.e. the word v with its last 
letter removed. We can redisplay 

p x (f = [f ,md =h& {v) \™av)), 



vet' 



as 



nc (etM) (w t ^)) n c (efW) (w f ^))nc (ef(i,)) (wf(t;)). 

In this expression, one can factorize out the product of probabilities of the subtrees f' u — t 
and f - = fj for j G {1, . . . ,r} \ {u±}, u\ the first letter of u, and £j for v & A. Letting 
j = u\ and u = ju', this shows 

P x (F=[f,M]) 

= p(«f(«))(r = t) p( 3i, )(r = f,) Y[p Mv) \t = Q Jjc (ef(u)) (wf(u)). 

We can also rewrite the last product as 



Wf(f)) 



p(w f (u)) • b 
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and we finally recognize 
P x (F=[f>,t]) 

= J^pfe-),|«|-i(( T) = ( f . 5 u '))p{^)){T = t) ( JJ PW(T = f,) ] 

= 6 a ,^>l[l {M , f} (y,F)E( e W)[l {t} (T)]/6 eW ], 
which yields the result. □ 

2.2 Around Perron-Frobenius' Theorem 

Let us first recall the well-known Perron-Frobenius Theorem, which in this form can be 
found in [T7j . 

Proposition 3 (Perron-Frobenius) Let M e 97t#(]R + ) be an irreducible matrix. 

(i) The matrix M has a real eigenvalue g with maximal modulus, which is positive, simple 
(i.e. it is a simple root of the characteristic polynomial of M), and every g-eigenvector 
has only non-zero entries, all of the same sign. 

(ii) Any eigenvector of M with non-negative entries is a g-eigenvector, and hence has only 
positive entries. 

From this, we deduce the following useful 



Lemma 2 (i) Suppose M e 3Jl^(M + ) is irreducible. Then its spectral radius g satisfies 
g > ma, for all 1 < i < K. 

(ii) Suppose K > 1 and g = 1. Then the matrix M G 0Jt^_i(IR + ) with entries 

niij = rriij + , 1 < i,j < K - 1 

1 - m KK 

is also irreducible with spectral radius 1. 

Proof, (i) is immediate by writing Ylf=i m ij u j = Q u iA < i < K, where u is a right 
^-eigenvector of M with positive entries. 

(ii) First, notice that the irreducibility of a matrix with nonnegative entries only 
depends on which entries are non-zero, and not on what their actual value is. Since 
1 — rriKK > by (i), it is sufficient to show that the matrix with entries 

max(mjj, rriiKniKj) , 1 < i, j < K — 1 (7) 

is irreducible when M is. Assuming M irreducible and upon replacing m^- by l{ mij >o}, 
we may assume M is the adjacency matrix of a connected non-oriented graph on the 
vertices 1, . . . ,K. But now, the matrix of (jJJ) is the adjacency matrix of the graph on 
the vertices 1, . . . , K — 1, where % is adjacent to j if and only if they are either adjacent 
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or both adjacent to K in the initial graph. It is straightforward to see that this graph 
remains connected, so its adjacency matrix is irreducible. 

It remains to show that M has spectral radius 1. First, it is immediate that if a, b 
are left and right 1-eigenvectors of M, then (ai, . . . , clk-i) and (&i, . . . , bx-i) are left and 
right 1-eigenvectors of M. Indeed, for 1 < j ' < K — 1, 



K~l K-l 

m Kj 



dirriij = cij — aK^riKj + — vtikk)- — - — = Uj, and bj 

1 — m K K , 

i=l 3=1 

for 1 < % < K — 1. This is enough to conclude by (ii) in Proposition El □ 



2.3 Reduction of trees 

2.3.1 A projection on monotype trees 

We describe a projection function IT® that goes from the set of K-tjpe planar forests to 
the set of monotype planar forests, and which intuitively squeezes generations, keeping 
only the type-2 individuals. 

Precisely, if (f , e) is a if- type forest, we first let v i -< v % -< . . . be the ordered list of 
vertices of f W such that all ancestors of Vk have type different from i. We consider a forest 
nW(f) = f with as many tree components as there are elements in {v±, V2, ■ ■ ■}■ Thus, we 
start with the set of roots 1, 2, ... of f. Then recursively, for each u G f , let v u ±, . . . , v uk 
be the vertices of (v u f Vu ) \ {v u } such that 

• e(v uj ) = i, 1 < j < k, 

• for every 1 < j < k, if v U j = v u l\, . . . then e{uli . . . l r ) ^ i for all 1 < r < h, and 

• v u i, . . . , v u k are arranged in lexicographical order. 

Then, we add the vertices ul, . . . , uk to f , and continue iteratively. If u £ f ' has children 
ul, . . . , uk, we let 

# W («) = #^\ (jJ f ^JJ 

be the number of vertices that have been deleted between u and its children during the 
operation. We also let 

^floo» = e f n = e{w) ^iVwhv}, 

be the number of vertices of the n-th tree component of f that lie below the first layer of 
type-z vertices. 

If (t, e) G T^ K \ we may apply the map 11® to the forest It, and get as a result a forest 
f. We denote this forest by 11® (t) with a slight abuse of notation. Notice that IT®(t) 
has one tree component if e(0) = i, in which particular case we denote it by f( = IT®(t), 
with further abuse of notations. 
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If [i is an unordered offspring distribution and i,j G [K], we let jj/- be the distribution 

of 

Cn(i)(T)( ) under P^ with the above conventions: if i ^ j then this counts the number 
of components of the reduced forest, while if i — j this is the number of children of the 
root of the reduced tree. 

Proposition 4 (i) Let x G [K] N . Under the law P x , the forest Uy> (F) is a (monotype) 
GW forest with offspring distribution Jl[ ' , whose mean and variance are equal to 

J>/f({z}) = l and Var(p«) = ^i 

2>0 1 % 

In particular, it is critical with finite variance. 

(ii) For each % and still under E x , the random variables (N^(u(n)),n > 0), (Aflo 0r ( n )' n — 
1) are all independent, and the variables (N^ (u(n)),n > 0) are i.i.d. The number of 
vertices which are deleted in the operation IT^ between two generations has mean 

E* [JVW(«(0))] = i-l, 

and finite variance. Similarly, the random variables (N^ OT (n),n > 1) have finite (x n - 
dependent) variance, as well as the lawsji-p fori,j e [K\. 

(iii) More generally, if \i admits a finite p-th moment with p G N (resp. admits some 
exponential moments), then so do Jif^ and the variables (u(n)) , N^ OT (n + l),n > 
under P x . 

The GW property of IL®(F) under E* is easy to obtain from Jagers' theorem on 
stopping lines [H], each subtree rooted at a vertex of type i being a copy of the whole tree. 
This also gives the independence statement in (ii). A detailed proof of these intuitive 
statements would be cumbersome, so that we leave the details to the interested reader, 
whom we refer to jH]. 

The rest of the proof of this proposition will be done by removing types one by one, 
and using an induction argument. 



2.3.2 From K to K - 1 types 

In this section, we will suppose that type K is deleted, keeping in mind that the general 
case is similar. If f G F {K \ we now let V\ -< v-i -< . . . be the ordered list of vertices of f 
such that e(vj) ^ K and e(v) = K for every v h Vi. Recursively, given v u G f has been 
constructed, we let v u \ -<...-< v u k be the descendents of v u such that e(v) = K for every 
v u h v h v U j, v {v u , v U j}, while e(v U j) ^ K. We then let 11(f) be the set of u G U such 
that v u has been defined by our recursive construction, and naturally associate a type 
e(u) = e(v u ) with 11(f). If u G 11(f), we let N(u) be the number of vertices of type K 
that have been deleted in this construction between v u and v u i, . . . , v u k, namely, 

= # K \ y fj) -i, 
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with the above notations. We also let Nq oot (ji) be the number of vertices v G that 
have only ancestors of type K. 

Lemma 3 Let x G [K}^ . Then, under P x : 

(i) for any 1 < i < K — 1, the forest n(F) is a non- degenerate, irreducible, critical 
K — 1-type GW forest. The (unordered) offspring distribution Ji = G [K — 1]) has 
generating functions 

#(s) = <pW(s,&*)(B)), (8) 
for 1 < % < K — 1 and s G [0, 1] , where ft*) IS implicitly defined by 

&V(a) = <pW(s t (9) 

(ii) the K — 1 sequences (N(u^ l '(n)),n > 0),i G [If — 1] are independent and formed 
of i.i.d. elements, where u®(0) -< u^(l) -< ... is the ordered list of elements of F with 
type i. Their generating functions ip^ respectively satisfy 

^%) = ^«(1,...,1,^( S )), (10) 

where ip( K ^ is implicitly defined by 

^ K \s)=s^ K \l,...,l,^ K \s)). (11) 

The random variables Nft OOI (n),n > 1 are independent as well. 

(hi) any integer or exponential moment conditions on fj, is also satisfied by the laws fl 
and the random variables N(u^(n)), Na oor (n + l),n > 0,i G \K — 1], 

Proof, (i) Again, the GW property follows from the construction. On some probability 
space, let X^ have same distribution as the 1 -valued random vector of children of 
a type-j vertex of II (F) under P x . Then, by separating the offspring of this vertex with 
types equal and different from if, we obtain the identity in law 



xw^(xP,...,x^ 1 ) + £xW(/) 



1=1 



where has distribution //W anc i i s independent of (X^(i), I > 1), which are indepen- 
dent with same distribution, and a vector X^ x ^ with this distribution must satisfy 



xw^(x 1 ^,...,xf_ ) 1 ) + £xw(0 



i=i 



with similar notations. These two expressions immediately translate as (jHJ) and (JUJ). Now 
let 

dip® 
dsj 



m ij = — (1) > 1 - h3 < K - 1, 
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so that M = (nT'ij)i<i,j<K is the mean matrix associated with the K — 1-type GW forest 
n(F) under P x . Differentiating (jHJ) and (jHJ) and letting s increase to (1, ... , 1) gives 

_ _ m iK m Kj 
1 — m KK 

for 1 < i, j < K — 1. Hence, M is defined as in Lemma so it is irreducible with spectral 
radius 1. Moreover, the if-type GW process associated with 11(F) under P x has to be 
non-degenerate, because it dies in finite time a.s.. 

(ii) The independence statement is again a consequence of Jager's theorem, and For- 
mulas (jlOj) and (fTTj) are obtained by similar distributional equations arguments as above, 
namely, 

x (i) 

V(»)) = E?. 
i=i 

where has law /j,® and iVj ,1 > 1 are i.i.d. random elements independent of X^, 
that satisfy 



(K) 



i=i 



On the other hand, N^ 0OI (n) is either or equal in distribution to n[ k ^ + 1 with the same 
notation, according to whether x n ^ K or x n = K. 

(iii) is obtained by differentiating equations (jHJ), ©? (HH an d (fTTj) p times, while the 
assertion on small exponential moments is obtained by applying the implicit function 
theorem to the implicit functions $ K \ Details are left as an exercise to the reader. 
□ 

Notice that this provides an alternative way of showing that the spectral radius of M 
is < 1, since the GW process has to be (sub)-critical in order to become extinct a.s. Recall 
that a, b are the left and right 1-eigenvectors of M with a -\ = 1 = a ■ b. In view of the 
proof of Lemma El the left and right 1-eigenvectors a, b of M satisfying a - 1 1 a - b 
are given by 

'ai, . . . , a*r_i) , b = — ^ — (bi, . . . , b K ^x). 



l-a K 1 - a K b K 

We are now ready to give the 

Proof of Proposition^ (i) We prove this by induction on K, in the case % — 1, without 
losing generality. The case K — 1 is obvious, since in this case M — (1), = (a 2 ) 
is indeed the variance of the offspring distribution and is the identity. According to 
Lemma El under P x , it is licit to do the K-to K — 1-type operation II, without changing 
the hypothesis that the GW processes under consideration are nondegenerate, irreducible 
and critical. This immediately gives the result on the mean of the offspring distribution 
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of n^)(F) by induction. The only statement that remains to be proved is the formula for 
the variance of its offspring distribution. Using again (JHJ) and (JHJ), straightforward (but 
tedious) computations show that, letting 

be the quadratic forms associated with the offspring distributions of 11(F) under P x , 

n(i) n (i) , m KkQ ( jK + m KjQkK rn K jm K k n {i) 
Q > k - l-m KK + {l-m KK )^ KK 

mix ( n (K) m KkQ ( ^K + m KjQ k K K m Kj m K k n (K) 
+ l-m KK {^ k+ l-m KK + (l-m KK y QKK 

It is then easy to check that 

a-Q(b) = l ~ a l . 2 a-Q(b). 

(1 - a K b K y 

Using the induction hypothesis, we obtain a • Q(b) = ai6i"Var so that 

axtf Vax (rf>) = " Q(b), 

(1 - o x 6 A ) 2 (1 - a^o A -) 2 

giving the result. 

(ii) We again prove this in the case i — 1. When iiT = 1 there is nothing to prove. 
If K = 2, one checks that the number of type-2 vertices trapped between two 1-type 
generations of U^(T) under has mean 77112/(1 —^22) and finite variance (resp. some 
exponential moment if has some), by differentiating ()10|) and (fTTj) once. Then, one 
obtains by direct computations that this is 02/01 = (1 — a\)ja\. 

So suppose K > 3. The idea is to apply the projection operation II, K — 2 times, 
removing types K, K — 1, ... ,3 one after the other. When this is performed, a two-type 
tree U ok (F) is obtained, and the number of type 2 vertices that have only the root as 
type 1 ancestor is precisely the number of type 2 individuals that are trapped between 
two generations of U^(F). By a direct inductive argument using Lemma El and the 
discussion after its proof, the mean matrix of this contracted two-type tree Il ofc (F) has 
(01,02) as left 1-eigenvector. In view of the K = 2 case above, the mean number of 
deleted type 2 vertices in a generation is thus 02/01. By symmetry, the average number 
of type i E {2, . . . , K} vertices deleted in a generation of U^(F) is Oj/ai. The average 
total number of deleted vertices is thus (a 2 + . . . + ok)/°i = (1 ~ a i)/ a i, as claimed. 

Finally, (iii) is obtained by applying point (iii) in Lemma El in a similar induction 
argument. □ 
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2.4 Two exponential bounds 

Let £ be an ordered offspring distribution with = fi satisfying (H). The following 
lemma allows to control the height and number of components in a GW forest. 

Lemma 4 There exist two constants < C, C < oo depending only on fi, such that for 
every n G N, x G [K] N and r\ > 0, 

P x ( max \u(n)\ > n 1/2+2 A < Cexpf-C'n"), 

\0<k<n J y 

and 

P x (T F n > n l ' 2+2 ^) < Cexp(-C"n' ? ), 

Proof. In the case K — 1, this is a straightforward consequence of Lemma 13]. In 
the general case, it suffices to note that under P x , independently of x, 

max \u F (k)\ < > max IwnCOrp^WI , 

0<fc<n ^— ' 0<fc<n ( ' 

ie[K] 

and 

n — / j n 

ie[i<] 

We may then apply the K = 1 case to each of the forests U^(F),i G [iC], which under 
P x are critical non-degenerate monotype GW forests by Lemma |3J □ 



2.5 Convergence of types 

A natural way to proceed to prove Theorem Q is now to use the known results of the 
monotype forest LTW(P), and to try and pull them back to the projected multitype tree. 
To do this, we must take care of two kinds of loss of information: the number of vertices 
with type 7^ 1 of F that stand between two consecutive 1-type vertices seen in H^(F) 
('time' information), and the number of vertices of F that actually stand between a type-1 
vertex of II^^P) and one of its sons ('height' information). 

For u G f G !F^ K \ let AnCf (i) := Ancj (e(v) = i) be the number of ancestors u such 
that e(v) = i. 

Proposition 5 Under (H), for every 7 > and x G [K] N , 



maxP* 

id[K] 



max 

0<fe<n 



Hi 



Anc 



u(k) 



a A 



> n 



1/4+7 



oen 



'121 



For this, we need the following moderate deviations estimate for Markov chains. 
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Lemma 5 On some probability space (Q,A,P), let (X n ,n > 0) be an irreducible Markov 
chain taking values in a finite set S. Let tx be its stationary distribution, and £ n = 
n~ l Ymc=o $x k be its empirical distribution at time n. Then, for any f : S —* R and 7 > 0, 
there exists N(f,j) > for every n > N(f,j), 

max P(k\Uf) - tt(/)| > n 1 ' 2 ^) < exp(-n 7 ) . 

0<fc<n 

Proof. If the Markov chain is also aperiodic, then according to Wu j20J Theorem 2.1 (a)], 
for every / there exists a constant C depending on / such that for every large enough k, 

P(k\Uf) - *(/)| > ^ 1/2+7 ) < exp(-C^) . 

If the chain has period d > 1, then the same result is easily obtained by partitioning the 
state space into the d periodic classes with equal 7r-masses and considering the d shifted 
chains (X n d+ r , n > 0) for < r < d — 1, which under P(-\Xq = x) are aperiodic for every 
x e S. 

Next, notice that for large enough n, max 0<Kn i/2+ 1 /2 P(k\£,k(f) — 7r (/)| > n 1 ^ 2+7 ) = 
since — 7r(/)| < 2 1| / 1| oo. Thus, we have, for large enough n, 

max P(k\Uf) ~ tt(/)| > n 1 ^) < max P(k\Uf) ~ *r(/)| > k 1 ' 2 ^) 

0<k<n n l/2+ 7 /2< fc < n 

< exp(-Cn 7+72 ) , 

entailing the result. □ 

Proof of Proposition O Suppose i = 1 with no loss of generality. By Lemma IH the 
probability that either > n x l 2+1 or max <fc< n > n l l 2+1 is an oe(n), so we can 

restrict ourselves to the complementary event. Thus, it suffices to bound the quantity 

PX (^Tf <ni/H7 imM0 , t ,„ | uW |< n i/H,} - Anc; W (l)| > n 1 '^ 

1/4+7 



< -P^i/2+7 1 max |ai&i|«| - Anc£(l)| > n 

1 J \weF,|u|<nV2+7 

for large n. By bounding the max by a sum over the same set, and then making use of 
the ancestral decomposition (Lemma HJ), this is less than 



^ l{|u|<nV2+7}lj 



|<n 1 /2+7}Jl|| aibl | u |_ Anc u( 1) | >n i/4+ 7 j 

.ueF 

< C ^ (Xj) ' h M^Anc^l) -0161I > n 1/4+7 ) 

3=1 h=0 

< Cn 1+27 max max P {i) ' h (MhT 1 Anc£(l) - aA| > n 1/4+7 ) 

ie[X] 0</i<nV2+7 v y 



2 PROOF OF THEOREM^ 



22 



where C = max; bij miiij b{ > 0. Recall that under pW'' 1 , the sequence (e(V ), . . . , e(V/J = 
e(V)) is a Markov chain in [iT] started at i with step transition pjj> = rrijj'bji/bj, and 
which admits ab := (a±bx, . . . , a^x) as invariant probability. Notice that ft, _1 AnCy (1) is 
the empirical measure of {1} for this Markov chain. The result is now a straightforward 
consequence of Lemma □ 

Next, recall the notation Af(fc) and let also G\(k) = Card {u -< uW(fc)}, where 
u^(0), ... is the list of type-i vertices of f , arranged in depth-first order. A similar 

notation holds for trees instead of forests, and we adopt the convention Cr*(#tW) = ^t. 

Proposition 6 (i) For any x e [K] N , under P x ; as n — > oo ; (Af(\ns\)/n,s > 0) con- 
verges in probability to s ^ OiS, for the topology of uniform convergence over compact 
sets. 

(ii) Moreover, if fi admits small exponential moments, it holds that for every 7 > 
and x G [A^] N ; 

P x (\G[(n) - aj l n\ > n l/2+ ^) = oe(n) . (13) 

Proof. With the notations of Section E3H for f e F {K \ let let N(k) := N®{u{k)) be 
the number of descendents v of u^\k) such that the types of vertices in {w : u^ l '(k) h 
w h v,w 7^ are all 7^ z, and N'(k) is the similar quantity, but counting only the 

vertices w that come before u^\n) in depth-first order. Then 

n— 1 n—l 

Gj(n) = £(1 + (*)) + E(A'(fc) - JV(fc))l {uW(Jfc)httW(B)} + N aL(k) , (14) 

k=0 k=0 k=l 



V— 

:=iJi(n) :=R 2 (n) 

Now, we estimate the probability that R\{n) + R2(n) is large, and by Lemma EJ for any 
fixed < 5 < 1/2, we may restrict ourselves to the event that the number of ancestors 
of type 1 of u^(n) is < n}l 2Jr8 and that the tree containing has rank < n l l 2+s , up 

to losing an oe(n) term. Then under this event, the probability of {i? 2 (n) > n 1 ' 6 } is less 
than 

' ,1/2+5 



iv floorW ^ u 

k=l 



Now under P x , the A^ or (/c)'s are independent, with respective laws that of Card{t> : i £ 
{e(w) : w h w}} under P™. By (ii) in Proposition 0] and Chebychev's inequality, this 
goes to as n — > 00. 

On the other hand, notice that |Pi(n)| < ^22=o ^ r (^)l{«(^(fc)hn( i )(n)}- Let r n = sup{/c : 
P x (A(0) > k) > n~ 1 }. Since A(0) has finite variance under P x by (ii) in Proposition 
HI it holds that P x (A(0) > t) = o(t~ 2 ) as t — > 00, so that r n = o(n 1 / 2 ), as otherwise 
r <^(n) > c^in) 1 ! 2 for some extraction 0, so that 

< P x (A(0) > r Hn) ) < P x (A(0) > ^(n) 1 / 2 ) = o(0(n)- 1 ) , 
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a contradiction. Therefore, for any r' n = o{n 1 / 2 ) such that r n = o(r' n ), we obtain that 

P x ( max N(k) > r' n ) < 1 - (1 - P(N(0) > r' n )) n — . 

On the other hand, we know from (i) in Proposition 0] and Proposition El that 

rT l l 2 max Anc^fi) < n _1//2 max ^ , 

0<fc<n 0<fc<n 

which converges in distribution as n — > oo to the supremum of a properly scaled Brownian 
excursion. Consequently, noticing that R\(n) is a sum involving Anc^ (i) terms, we 
obtain that for every e > 0, there exists C > such that 

P x (n^R^n) < Cn^n 112 ^) >l-e 

for every n large, whence R\{n) = o(n) in probability. 

These estimates, when combined with (I14jl and the law of large numbers, entail that 
Gf(n)/n converges in probability to the mean of 1 + N(0) under Pw, which by (ii) in 
Proposition |U is 1/aj. Therefore, Gf ( |_ns J ) /n — * a^ 1 ^ in probability for every rational s 
and we claim that the convergence holds for the uniform topology over compact subsets 
of BL To see this, one can use Skorokhod's representation theorem and assume that the 
convergence of Gf(\ns\) is almost-sure for every rational s, and then apply a standard 
monotonicity, continuity and compactness argument. It is then elementary to conclude 
that the right- continuous inverse function (Af ( \ns\ )/n,s > 0) converges in probability 
to s ^ a>iS for the uniform topology over compact sets. 

Part (ii) of the statement is obtained along closely related lines, by first noting that 
this time, for < 5 < 7/2, and using similar notations as above, 

( n l/2+S \ 
iV floor(^) > nl/2+2$ +° e H 

< exp(An l/2+s - en 1/2+2S ) + oe(n) = oe(n) 
for some A,e > 0. Then, we have P x (max <fc<„ N(k) > n" 5 ) = oe(n), so that 

P x (Pi(n) > n 1/2+2S ) < P x ( max |«(Jfc)| > n 1/2+s J + P x ( max A(A;) > n 

yo<fc<n y yo<fc<n 



oe n 



Finally, the estimate 



n-l 



k=0 



> n 1/2+ ^ ) = oe(n) 



is a standard moderate deviations estimate for random variables admitting small expo- 
nential moments, see f5J Theorem 2.6]. This is enough to conclude. □ 

Notice that the previous statement immediately implies point (iii) in the statement of 
Theorem ^ 
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2.6 Proof of (i) and (ii) in Theorem [T] 

For any s > 0, we have 



o-n«(F) 

Af (|ras|)-l 



< 



/i i \ Anc j? 
u{\ns\) — 



u{\ns\) 



,w(LnsJ)/.^ 



a A 



By Proposition El we obtain that, for every A > 0, 



sup n 

0<s<A 



-1/2 



M (L ns J) 



Anc 



u([nsj) . 



a A; 



in probability as n — > oo. On the other hand, we claim that 



^Af(M)-l 



Anc^ {LnsJ) (z^ 



< 



ff nW(F) _ ff n«(f) 

^Af(M)-l ^Af(Ln S j) 



+ 1. 



(15) 



Indeed, if t{W(|_nsJ) is an ancestor of w(|jisj), then the left-hand side is zero and there 
is nothing to prove. Else, the left-hand side equals the number of ancestors of type % of 



u^(\ns\) which are not ancestors of w(|jisj), and so H 



n«(F) 

Af(M)-l 



Anc u F ilnsl \i) > 0. On 



the other hand, the strict ancestors of u^\\ns\ + 1) that are not ancestors of type i of 
u([ns\), cannot be themselves of type i by definition (otherwise, such an ancestor would 

come after m(|_^ s J) and before u^\[ns\ + 1) in depth-first order). Hence, 



Anc 



u(|rasj) 



Af(M) 



i) < 1, so that 



< H 



nW(F) 

Af(M)-l 



— Anc 



u([ns\) 



i)<H 



n«(F) 

Af(Ln*J)-l 



o-n«(F) . 

Af ([ns\) + 1 



and the claimed inequality follows. 

Under P x , the forest IlW(F) is a single-type GW forest whose offspring distribution 
has finite variance by Proposition HJ so that by Proposition 



n 



-1/2 



max |-« fe _ 1 - H k 



(F)\ 







0<fc<n 



and it follows that under P x , 

^(^j-^Al-^'S^^O 



(16) 



for the topology of uniform convergence over compact sets. 



Using Propositions El ID and El and composing s i— > H^\^ with s i— > Af (|_nsj)/n, we 

now obtain that (^"^^^([ni])' s > 0) converges in distribution to (2a^ 1 \B aiS \, s > 0) 

where of = Var(7Z^ ), which is also equal in law to (2a] 'a~ 1 \B s \, s > 0). One way of 
seeing this is to use Skorokhod's representation theorem to exhibit a probability space 
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where the convergences of Propositions|2]and|n]hold a.s. rather than in distribution. Point 
(i) of the theorem is now proved by using |T6|) . 

Let us prove (ii). By definition, since all the roots are of type i, u([ns\) and the last 
node with type i before w(L^sJ) in depth-first order belong to the same tree. Therefore, 
the label of the tree of F containing w(|_nsj) is always the same as the label of the tree of 

nW(F) containing the Af(|_nsJ)-th node. This implies that T^fi^.j = Y(^ s j- Now, the 
result is a plain consequence of Proposition El and of similar arguments as above. □ 

Proof of Corollary From (ii) in Theorem we obtain that (n~ l H^ 2s ,0 < s < r n ) 
converges in distribution to 2a~ 1 (|£> s |, < s < X& i0 -i), where r n is the first hitting time 
of n by T F and % x is the first hitting time of x by LP. 
Now, 

P*(V1 < k < n,H(F k ) <n) — ► P(2a- 1 \B S \ < 1,0 < s < %. a -i) , 

n— >oo 

which can be rewritten as 

(l -P (i) (ht(T) > n)) n — ► exp ( --N ( -supe > 1 

where A^(de) is the Ito excursion measure of the standard Brownian motion (see e.g. ^01 
Chapter XII] for definitions and the results recalled below), and where we have used the 
Ito decomposition of a Brownian motion into a Poisson process of excursions in the local 
time scale. Taking logarithms and using iV(supe > x) = l/x, gives the result. □ 
Let us also mention that similar arguments, following the same lines as in jU Propo- 
sition 2.5.2], actually show the more general result: 

Corollary 2 For every a > the probability measures P^iji^H 1 ^ E ■ |ht(T) > an) 
converge in distribution as n — > oo towards N{2o~~ l e G ■ |2o"^ 1 supe > a). 

2.7 Conditioned results: Theorem [2] 

Our main tool for conditioning is the following estimate for the size of GW trees. 

Lemma 6 Let \i be a critical non- degenerate offspring distribution with finite variance. 
Then for every i,j G [K], one has 

n V*pW(#T® =n) — ► dj, 

where if necessary the limit is taken along a subsequence for which the probability on the 
left-hand side is non-zero, and for some constant > 0. 

Proof. This is very similar to Lemma 14 in ^H]- If i — j, then using the fact that the 
reduced tree of Section 12.3. II is a monotype GW tree, the result is a well-known fact. To 
treat the general case we elaborate slightly on the proof. 
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Let r 6 Nbe fixed, and recall j = (j, j, . . .) and the notation at the very end of Section 
11.41 Let fi := ~pj be the offspring distribution of the GW tree U^(T) under and on 
some probability space (fl,A,P), let (W n , > 0) be a random walk with step distribution 
/}(• + 1) on {-1, 0, 1, 2, . . .}. Under pJ, the forest U^(F) is a monotype GW forest with 
offspring distribution jj, and r tree components. It is then well-known that 

pj(#n^(P) =n)= pj(#P (j) = n) = ^P(W n = -r) . 

By the local limit theorem in the lattice case [SJ Theorem XV.5.3], n 1 ^ 2 P(W n = — r) — > C 
as n — > oo for some C > 0, and for every r such that the probabilities under consideration 
are > 0. Moreover, there is a common uniform bound for all the terms as r varies along 
the admissible values. 

Let p(r;i,j) be the probability that there are r tree components in IL^(T) under 
PW. Notice that the probability distribution (p(r;i, j),r £ N) has finite expectation (its 
generating function is Tp^ with the notations of Section l2.3.1|) . so that Y2 r r P( r '-> hj) < °°- 
Then 

P«( #T (i) =n) = J2p(r; i, j)pJ(#nW(P) = n) , 

r>l 

and an application of the previous paragraph and dominated convergence (using the fact 
that n 3 / 2 pj($TL J ) (P) = n) is uniformly bounded) gives that 

n Wp®(#T® = n) -> Cj2 r P(r;i,j) , (17) 

which is the wanted result. □ 

Lemma 7 The respective laws of the number of tree components of US') (T) under the 
probability distributions PW(-|#T^ = n) converge weakly as n — > oo. 

Proof. We use the notations of the previous proof, as well as the expression ()17j) of the 
constant Cjj. Observe that the P^-probability that Yi^\T) has r components given it 
has n individuals is 

f*^w = r i# r<,) = ») = "'^y t - - v rp 't- j) ^ • 

and this does define a probability distribution. □ 
The following modification of Proposition El for forests with a fixed number of trees 
also holds: 

Lemma 8 In the case K = 1, assume (H) and take r £ N. Lei P r := P* &e £/ie /aw 
o/ a monotype GW forest with r tree components and offspring distribution fi := fx^. 
Then the process (n _1 / 2 P^ s j , < s < 1) under P r (-\#F = n) converges in distribution to 
2a~ 1 B cx as n — > oo. 
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Proof. The law of the total size of a GW tree is in the domain of attraction of a positive 
stable law with index 1/2, as follows from the previous lemma in the case K = 1, so 
that when taking r independent copies of GW trees with offspring distribution /x, and 
conditioning their sum to be n, only one of the trees has a size of order n, while the 
others r — 1 trees have o(n) size, hence have maximal height o(n 1//2 ) according to Lemma 
0] Hence the result. □ 



Proof of Theorem [2J The proof starts like the one of Theorem ^ For < s < 1, 
write 

rr(i) it\ 



a [#Ts} 



^Aj(L#Tsj) 



ajbj 



< 



RJs) 



(18) 



where \R n (s)\ < {ajbj) 1 (2 max < fc <„ \H k _ x - H i 



nW)(T) Tj-nW(T)i 



We start by showing the convergence of the processes (n ' if 



1) 

1/2 o-nO)(T) 



/aj-6j,0 < s < 1) 



AJ(L#T S J)/^ W J 

under P®(-\#TV) = n). Since n^(T) under pW is a GW forest, and by first conditioning 
on c n y)( T )(0) = r, we obtain using Lemmas [7| and |H] that (ri~ 1//2 P^j < ' T ' ) , < s < 1) 

under P^\-\^T^ = n) converges in distribution to 2o r J 1 B cx , where a 2 = Var (jlj), with 
definitions from Proposition HJ 

We now show that (n~ 1 Aj( \_H z Ts\ ), < s < 1) converges in probability to the identity 
on [0, 1], under the conditioned measures. Using Lemma El and (ii) in Proposition one 
obtains that for some C > 0, for every s G [0, 1], 



a7 l sn\ > n 1/2+1 \#F[ 



U) 



n 



< Cn^P' 1 (\Gj (LH) - a J lsn \ > n 1/2+ ~<) = oe(n) (19) 
Since Pi under P 1 has same distribution as T under Pw, we thus obtain that for < s < 1, 



pW(|Gj(LnsJ) - afsn] > n l / 2+ ^\^T^ = n) = oe(n) 



(20) 



which shows that n 1 G^([^sJ) under PW(-|#T^ 



j) 



n 



converges to a ■ s for every s G 



[0, 1] in probability, and thus, by the same reasoning as in Section I2.6| we obtain that 
(n _1 (jj(|_nsj), < s < 1) converges in probability to (ei^s, < s < 1) for the uniform 
topology In particular, for s = 1 we obtain that n~ x j^T under P^\-\^T^ = n) converges 
to aj 1 in probability, where we recall that we adopted the convention Gj(n) = #T. 

Now, (n -1 Aj( |_#TsJ ), < s < 1) is the right-continuous inverse of (#T~ 1 Gj( [ns\ ), < 
s < 1), and as such, it converges to the identity of [0, 1] in probability for the uniform 
topology. 

It remains to show that the two terms on the right-hand side of (|18|) are o(n 1//2 ) in 
probability, uniformly in s G [0,1]. First, notice that letting T J := c n o)( T )(0) be the 
number of tree components of H^(T), then the law of U^(T) under pW(.|T- 7 = r) is 
the same as that of II^(T) under PJ, i.e. is that of a monotype GW forest with r tree 
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components. Using Lemma |H1 one concludes that pW(sup 0<s<1 n 

n, T- 7 = r) converges to for any e > 0. Using Lemma® we know that the laws of T J 
under PW(-\#TV> = n) are tight as n varies, so that we get PW(sup 0<s<1 n _1//2 |P n (s)| > 
e\#T& = n) -> as well. 

Finally, by applying ptljl for ,s = 1. we obtain that pW(#T > An|#T^ = n) = oe(ra) 
for any A > aj 1 . Combining that with Proposition El gives for such A and some C > 0: 



p(») | 

0<fc<#T 



u(k) 



Anc u T (k \j) 



< Cn 3/2 P i [ max 

0<fc<An 



u(k) 



> n 3/8 |#T^ = n 
Ancf^j) 



> n 3/8 + oe(n) = oe(n) 



hence the result. 



□ 



Remark. In the companion paper ^3], a similar statement as Theorem H] was needed, 
with the law P^\-\j^T^ = n) replaced by p(w(-|^pw) = n), i.e. by a forest with two 
trees conditioned by their total number of vertices of type j. The proof of such a statement 
should be clear from the previous methods: by applying the transformation II ^ to this 
forest, one obtains a monotype Galton- Watson forest with a random number of roots that 
is tight as n varies, and conditioned to have n vertices. By conditioning on its number of 
roots and applying Lemma |H1 this implies that none but one of these trees have more that 
o(n) vertices. Hence in the initial forest with two components, only one of the components 
has more than o{n) vertices, and the result is now a consequence of Theorem HJ which 
will be proved in the next section. 



3 Proof of Theorem 01 

The key technical results needed to prove Theorem El are, like in ^Hj, a control on the 
frequencies of branching events in GW trees, which will allow to prove the convergence 
of finite-dimensional marginals of the snake, and a bound on a Holder norm-like quantity 
for the rescaled height process, which will imply the tightness. 

3.1 Exponential control of branching events 

For u e f G F^ K \ let AnCf (i, w, I, h) := AnCf (e(v) = i,Wf(v) =w,ue vlf v i, \v\ > \u\-h), 
i.e. the number of ancestors v of u with type i, with children's types w, such that u is 
the descendent of the Z-th child of v, and that are at distance at most h from u. In the 
sequel, when dealing with quantities of the form Anc^ (P„), and if the last argument is h, 
we will understand that we consider only those ancestors v of u such that \v\ > \u\ — h. 
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Lemma 9 Assume \l satisfies (H) and has small exponential moments. Then for every 
x G [Kf and 7 > 0, 

/ \Anc£ k \i,w,l,h) - fra^C (i) ( w )l - 
P max max —7— > 1 | = oe[n) 

\ 0<A<nnT<h<|«(fc)|,weWjc,l<J<|w| h L l z+ ^ 

We first state an intermediate lemma. Recall the construction of the size-biased infinite 
tree t of Sect. 12. 1\ and the spinal path = Vo, V\, . . .. We assume t to be constructed on 
some probability space (Q,A,P). If u G t, we let wtfu) G Wk be the ordered sequence 
of its children's types, as for finite trees. 

Lemma 10 (i) The sequence ((e(V n ), e(V n+ i)), n > 0) is a Markov chain with step tran- 
sition P(i,j),(i' ,j>) = 3ji'bj>mjji /bj and equilibrium measure nuj) = dibjiriij. 
(ii) Conditionally on (e(Vk),k > 0), the variables (w^(V n ), V n+ i(n + l),n > 0) are inde- 
pendent, (here V n+ i{n + 1) is the last letter ofV n+ i, so that V n+ \ = V n V n+ i{n + 1)), with 
law defined by 

P ((w-t(K), Vn+i(n + 1)) = (w, /) I (e(V k ), k > 0)) 



m e (V n )e(V n+1 ) 

for every w G Wk and 1 < I < |w| such that W[ = e(V n+ i). 

Proof. The first statement is immediate to check since we already know that (e(V n ),n > 
0) is Markov with step transition pjj> = bj>rrijj>/bj. The conditional independence of 
W|(Kj), V n+ i(n + 1), n > is easy from the construction of t, and we have 

p<< (\i \ x/ ( +u\ < m A/n>n^ p (( w t(K), Wn + 1)) = (w,Q|e(V n )) 

p w t (V n ,K+i = (w,0 \e(V k ),k> = — — - — r — — , 

P(e(V n+ i) = wi\e(V n )) 

which amounts to the desired result. □ 
Proof of Lemma ® Fix 7 > and choose < rj < 7 2 . First, we claim that 



P x I max cp(u(k)) > riP \ = oe(n) . 

\0<k<n J 

Indeed, under P x , the sequences (wf(w^(fc)),ij > 0), for % G [K], are independent i.i.d. 
sequences with respective common distribution as follows from the Markov branch- 
ing property of jH] and the fact that, when exploring the forest in depth-first order, no 
information on the set of children of the vertex explored at step n can be obtained before 
this step. Hence, the fact that /x admits small exponential moments gives that 

P x ( max max c F (u^(k)) > n v ) < Kn max( w (|w| > n v ) 
\ie[K]0<k<n J ie[K] ' 



ie[K] 
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which by (jSJ) and Markov's inequality is an oe(n). Since 

max Cf(u(k)) < max max Cf(u^(k)) , 

0<fc<n ie[K) 0<k<n 

this gives the claim. Considering this, and using Lemma EJ we may restrict ourselves to 
showing that 

P, x , ( -ax max |AncM», w,f fe) - ^.^(w)! \ 

where 5„ )/v = {(fa, w,l) 6 Z + x VWk- x Z + : n 7 < fa < fa', 1 < I < |w| < n' 7 }. By using 
Lemma ^ in a similar way as in the proof of Proposition the probability in the left-hand 
side of (jUJ) is bounded above by 

r V V" pW.q Tnov |Anc T (z,w,/,fa) -faaA,,C W (w)l \ , . 

r=l h'=\n~<\ V > > v n,h / 

where C = maXj bjj mim, b r Writing <5i,w,«,ft f° r the quotient appearing in the probability, 
we have 

P (Xr) ' h ' ( max Q i>wM >l)< n^+^K^ max P>"^' (Q i>wM > 1) , (23) 

where n 1//2+7+T? _ft' n ' ? bounds the cardinality of B n h i with fa' < n 1 / 2 " 1-7 . Let Anc"(z,j, fa) = 
Anc"(e(u) = i, e(vu\ v \ + i) = j, \v\ > \u\ — fa) be the number of strict ancestors v of u G t G 
such that e(v) = i, \v\ > \u\ — h and e(v') = j whenever v' is the child of v with 
v h v' h it, so that according to (ii) in Lemma ITUI under pMfi and given Anc^(z, j, fa) = 
fc, the random variable Anc^ (z, w, /, fa) is binomial with parameters (k,^(w)/rriij), for 
every w, / with wi = j. 

By Lemmas fT0|. (i) and0 we have for every n 1 < h < fa', 

max P^'flAnc^z, j, fa) - ha^m^ > fa 1/2+7 ) < exp(-fa 7 ) 
ij'e[K] 

< exp(-n 72 ), (24) 

for every large enough n. In particular, for some c > 0, infj jefK] Anc^(z, j, fa) > cfa with 
P^ r ^ ,?i '-probability < exp(— n^ 2 ) for every n< < fa < fa' and large enough n. Moreover, 

p (av) '"'(g,,wM> i) 

fa 1 /2+7 



< p(x r ),h' 



a y / • t n C W (w)Anc^(2,^,fa) 
Ancf (z, w, /, fa) 



my 



> 



+ pM,h' ( \Anc%(i, Wl ,h) - ha^rrii^W/mij > 



2 

faV2+7 
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Since C^(w) — ano - using (ED); the second probability is < exp(— n' y2 ) for large n, as 
long as (h, w, I) G B n ^ for hi > n 7 . 

Also, according to Hoeffding's inequality for binomial distributions, it holds that 

> i | Anc^(i,i, h) = jfe) 
< 2exp(-2/i 1+2 7£;), 

which for h > rf and k < ch is less than 2 exp (-2n 27 /c). This is enough to conclude 
that for every n 7 < h < h' < n 1 / 2+7 and large enough n, 

max P^'iQi^h > 1) < exp(-n 72 ) . 

(h,vf,l)&B nh i 

Combining with (J2*3*j) and then ()22|) . gives the result (notice that by our choice of rj, we 
have 7 2 > 77 so that the quantity on the right-hand side of (J2*3j) is an oe(n)). □ 

We also have the following controls 

Lemma 11 Under the same hypotheses as Lemma\^ 

(i) For every T, r] > there exists C > so that for every large n, 

P x I max c F (u(k)) > Clogn ) < n~ r . 

\0<k<n J 

(ii) For every c > there exists 7 G (0, 1) so that 

^Anc u F {k) 

(iii) For every M > 0, i/iere a constant Cm > sitc/j that 

P x ( max max sup m M h~ l A\ic u J; k \cF{v) =m,h)> Cm I = op 

\0<k<nnv<h<\u(k)\ m >i / 

i/ere x„ = op(n) means that for every T > 0, x n < n~ T for every large n. 

Proof. Point (i) follows similar lines as the beginning of the proof of Lemma El We 
bound the probability of interest by 

ifn max «^(|z|i > Clogn) < Kn v ~ Ce max «^(exp(£:|z|i)) 

for some appropriate e > 0, and choose C so that 1 — Ce < —T. 

For (ii), we see that, using Lemma EJ it suffices to bound the probability 

-P| X i/2+„i I max max /i~ 7 Ancp(cp(f ) > clogn, h) > 1 I . 

Vn J \ueF,\u\<n^/2+ini<h<\u\ W / 



P x I max max h 7 Anc P (cpiv) > clogn, h) > 1 = oe(n) . 

1 0<fc<nni<h<|u(fc)| 6 ' '- 1 



n) . 
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Using Lemma Q this is bounded up to some fixed multiplicative constant by 

max ,P {Xr) ' h ' {Anc^{c T {v) > clogn, h) > K 1 ) . 

r=l h^ln^l ~ ~ 

By conditioning on the types of the ancestors of V under p( XrS >' h \ we may again use (ii) 
in Lemma fTUl to argue that the number of ancestors v of V such that Cf{v) > clogn and 
|u| > |V| — /i is a sum of independent Bernoulli random variables with parameters of the 
form 

C (i) (|w| > clogn) < ^_ C£ maxig [y] C W (exp(g|w|)) 

m ij mm ij-m ij7 L m ij 

so that it is stochastically bounded by a sum of h i.i.d. Bernoulli random variables 
with parameter ft, _C£ /( 1 / 2 +'?) < h~ C£ as soon as < n < 1/2. Now if Xi, . . . ,X n are 
i.i.d. Bernoulli random variables with common parameter p on some probability space 
(Q,A,P), P(J2i=i X i > x ) < exp(-Ax + np(exp(A) - 1)) for every A > 0, by Markov's 
inequality. Therefore, by choosing A small enough, for some C > 0, and by unconditioning 
on the sequence of types in the end, 

p( Xr ),h' (Anc£(c T (tO > clogn, h) > 2h l ~ C£ ) < ex^-Ch 1 ' 06 ) , 

hence the result by choosing 7 G (1 — ce, 1), provided n v < h and n is large enough. 

Point (iii) is obtained following similar lines. We are even going to prove the result 
with Anc^ (cp(v) = m,h) replaced by Anc^ (cp(t>) = m,h). Fix T > 0, then the 
probability under consideration is bounded up to a multiplicative constant by 



J L" J / \ 

V V P {Xr) ' h ' I max max m M h' x kncl(c T (v) > m.h) > C M ) 

^— ' ^— ' \Km<ClognnK/i</i' / 

r=l h'=\n*\ \ - - b / 



+ o(n" r ) , 



for some C > 0, where we used point (i) of the lemma to bound the maximal degree of a 
vertex in the tree, and then Lemma H The first term is in turn bounded by 



""' max max max P^' h '(m M h 1 Anc^ (c T (v) > m, h) > C M m M h) 



,1+277 . 

i€[K] n 1 )</i</i'<n 1 /2+»? l<m<Clogn 



Using again the large deviation inequality as above, we obtain that the probability on the 
right hand-side is bounded by exp(— A/i(Cm^~ m — C"exp(— £m))) for some A > and 
where C, £ are such that 

max C w (|w| > x) < 2- 1 C"exp(-^x). 

ie[K] 

By choosing Cm large enough, we obtain that 

exp(-\h(C M m- M - C"exp(-£m))) < exp(-A'/im- M ) 

for some A' > and every m > 1. Since we consider only the terms m < Clogn and 
h > rf, we get that hm~ M > ri n (C\ogn)~ M which is > n v ^ 2 for large n, and this allows 
to conclude. □ 
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3.2 Finite-dimensional marginals 

We are now able to show the convergence of finite-dimensional marginals for the spatial 
process S. Notice the slightly loosened hypotheses when compared with Theorem El 

Proposition 7 Assume that £ satisfies (H) and admits small exponential moments. Also 
assume that the spatial displacement laws v iV{ are non-degenerate, centered and have a 
finite variance (u i{W , satisfying 

max<i/ ijWj Ij/H) =0(\w\ d ), (25) 

ie[K] 

for some D > 0. Then, for any x £ [K] n , jointly with the convergence (i) in Theorem^ 
for any < s 1 < s 2 < . . . < Sk, it holds that under P x , 

(^,i<i<*),^(^,i<i<*), 

where R has the law described in Theorem [3J If x — i, it also holds jointly with (ii) in 
Theorem^ 

Proof. Since the proof is very similar to that of fHl Proposition 28] and the general 
case does not involve more sophisticated tools besides notational annoyments, we give the 
detailed argument only for k — 2. 

Making use of Skorokhod's representation theorem, we will assume that the discrete 
snakes we are considering are defined on a probability space (Q, A, P) so that following 
holds. First, this space supports a sequence of processes H n , A", i £ [K] with same distri- 
bution as n~ x l 2 Hy n ^ , Af ( [n-\ ), % £ [K] under P x , such that H n converges almost-surely 
to a process B which is 2cr~ 1 times a standard Brownian motion, while A"/a, converges 
almost-surely to the identity function. Otherwise said, the convergence of Theorem Q is 
almost-sure. Second, for every n > 1, the processes (n 1 / 2 /^, , Af(k/n), i £ [K],k > 0) 
are the height and type-counting processes of a unique multitype random forest (F n , e n ) £ 
!F^ K \ and we assume that the discrete snakes are defined by using a family of random 
variables (Y™,u £ F n ) that are supported on (Q,A,P). That is, given F n , the vectors 
(^3)1 < l < cpn(u)),u £ F n are independent with respective distributions y e n {u),-w F n{u)i 
and we let 

S n k = Yl F "> k>0 

vhupn (k) 

be the associated discrete snake. 

Let s < t be given. For simplicity write u = upn( [ns\ ), u' = UF^({nt\) and u the most 
recent common ancestor of u, u'. The vector (S™ s , S™ t ) is the image by the application 
(x, y , z) i— > (x + z , y + z) of the vector (L™, R™, C n ) where 

l? = R i= E c» = £>?. 
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We aim at proving that given B, n" 1 / 4 (Ly, RJ, C n ) converges in distribution to a triple 
of independent Gaussian variables with respective variances E 2 B S , T, 2 B t , Y?B s ^ t , where 
B s ,t = inf s < u <t-Bu- 

First note that conditionally on F n , L", R", C n are almost independent as they are 
sums involving terms which are all independent but for one: if I, V are such that u G ulF? 
and u' G ul'F^,, i.e. tt and u' are in the Z-th and /'-th subtree pending from u, then 
and Y? v may be dependent. However, letting L" = Y£ + and R" = Y^/ + R£, the 
vector (Lg, R£, C n ) has three independent components. It is thus sufficient to prove that 
YP[ and n^^YP both converge to as n — > oo, and to show the individual 
convergences of n _1//4 L n , n~ 1,/4 R n , n _1 / 4 C n to three Gaussian variables with the correct 
variances, where L n and R n are defined as L™ and R™, but allowing the term v = u in the 
sum. 

Using Lemma E] for T > 1 and applying the Borel-Cantelli Lemma, we have that 
almost-surely, for n large enough, cp«(n(A;)) < Clogn for every k < nt and for some C > 
0. Conditionally on F n and on this event, the expectation of max^(Y^) 2 is bounded above 
by (Clogn) D by our assumption on moments, from which it follows that n~ 1//4 maxzY^ 
does converge to 0. The reasoning is similar for estimating n~ l ^Y£. 

Let us now deal with L n , the case of R n being similar. The sum defining L n is a sum 
along a part of the ancestral line of u, whose height is approximately ^Jn{B s — B S)t )- Note 
that a.s., B s — B s t > by standard properties of Brownian motion, so that the sum in 
question has an order of \fn terms. Write 

L™ = ^ Y V 1 {c F n(u)<c} + ^ Y V 1 {c F r l {u)> C } ■ 

v!rv\-u u"rv\-u 

We call these two sums and L™. 
We first argue that 

lim limsupP(r^ 1/4 |L™| > e\B) = 0. (26) 

C >OC n — > oc 

Letting h — \u\ — \u\, we noticed that given B, h is of the order n 1//2 . In particular, using 
the Borel-Cantelli Lemma and (iii) in Lemma ITT1 for M = D + 2, for some constant Cd, 
and every m > 1, a.s. Anc^„(cpn(t>) = m,h) < CDm~ D ~ 2 h, for every large enough n. 
Then by Chebychev's Inequality and conditional independence of the terms, 

P{n- 1 l A \l n c \>e\B,F n ) < n^e" 2 Yl Anc M*, w, I, h)(u i>vr , yf) 

ie[K] w&W K ,\w\>c l<i<|w| 

< Cn~ x l 2 e~ 2 m D Anc u Fn {c F n (v) = m, h) 

m>c 

< C D e~ 2 n- l ' 2 hY,m-\ 

m>c 

at least for (w-dependent) large enough n. Notice that this upper bound does not depend 
on F n . Since n~ 1 ^ 2 h converges to B s — B s t , taking the conditional expectation given B 



3 PROOF OF THEOREMS 



35 



and applying the reverse Fatou Lemma, we obtain 

limsupP(n- 1/4 |L™| > e\B) < £[limsup P((n" 1/4 |L™| > e\B, F n )\B] 

n— >oo n— >oo 



m 2 



m>c 

which goes to as c — > oo. 

On the other hand, fixing rj, e > 0, we may apply Lemma 01 and Borel-Cantelli to 
obtain that a.s., for large enough n, and every 0<k<nt,n v <h< \u(k)\,w € Wk with 
|w| < c and 1 < I < |w|, 

|An4 ( n fc) (i,w,Z,/i) - h ai b Wl (^(w)\ < h 1 ' 2 ^ < e/ia^C (i) (w) . (27) 
Now, we can redisplay L™ as the finite sum 

w 

E S Y v ll {e{v)=i^ Fn {v)=^,u(ivlF^} , 

i£[K]weW K ,\w\<c 1=1 uhvhu 

and given F n , the last summation has Anc^„(z, w, /, h) i.i.d. terms (for h = \u\ — \u\) with 
variance (^ jW ,y 2 ). Using (|2*7j) . we can apply the the central limit theorem conditionally 
on B,F n , and obtain that ri~ 1//4 L" converges in distribution to a Gaussian variable with 
variance (B s — B Stt )Y? c where 

|w| 

E '=E^ E c w (w)^^Kw,^ 2 ). 

ie[if] wGWk,|w|<c 1=1 

As c — ^ oo, this converges to the constant E 2 of the statement of the theorem, which 
implies the result when combined with (|2fi|). 

To complete the proof, it remains to show the convergence of n -1 / 4 C n . The argument is 
essentially the same, where the height h under consideration is of the order y/nB s t rather 
than y/n(B s — B st ). The argument above is unchanged in the case B s t > 0, but is not 
valid anymore if B S)t = 0. In that case, we claim that the result is in fact trivial as we have 
\u\ = 0, so that C™ = 0. Indeed, assume for a moment that x = i for some i £ [K\. By 
Theorem Q and plugging a new random element in our use of Skorokhod's representation 
Theorem, we may assume that (n-^Yj^j, s > 0) converges a.s. to a multiple of the local 

time L° of B at level 0. Since B s t = and B s B t > 0, there is an increase time of L° 
between times s and t. Therefore, the function T F " increases as well between these times, 
at least for large enough n. This means that u and u' are in two different tree components 
of F n , and thus u — 0. 

The case where x is any element of [K] N is a slight elaboration of the preceding 
argument, which we briefly sketch. If B s t = but \u\ > for infinitely many n, this 
means that for these values of n, the vertices u, u' belong to the same tree component 
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1/9 IT* 1 '' ( F n ) 

of F n , whose root is of type i, say. Now, n~ ' H,, converges to a multiple of the 

,„ ( F n ') 

Brownian motion B./ a . by the proof of Theorem [U while n ~ ' T, , converges to a 
multiple of the local time of the latter. Necessarily, there is a time of increase of this local 
time between the times a^s and Ojt, hence the function n _1 / 2 Tj^ ■* increases also during 
that time interval, whose length corresponds asymptotically to the fraction of vertices of 
type % that appear between u and u' in depth-first order. This is a contradiction with the 
fact that u and u' belong to the same tree component of F n . □ 



3.3 Holder norm bounds 

Proposition 8 Let /x satisfy the basic assumption (H), and admit small exponential mo- 
ments. For every A > and a £ (0, 1/4), for every e > 0, there exists C > such that 
for every x £ [-^] N , 

SUp \ H ns - H nt\ > c | ^ g _ 



nSN \ s^te[0,A],ns,ntgZ4 



Remark. As will be shown below, in the particular case K = 1, we are able prove the 
same assertion with a < 1/2 rather than a < 1/4. In [IH] . we could also obtain the result 
for a < 1/2 because of the particular nature of the multitype trees we were considering, 
i.e. alternating types, corresponding to an antidiagonal mean matrix. In the general case, 
our method of approximation by the monotype case does not seem to be fine enough to 
obtain the best estimate. 

Proof. We first prove the result in the case K = 1, and let /i := P := P 1 . Our proof 
is partly inspired from that of jU Theorem 1.4.4], of which we can interpret Proposition 
[HI to be a discrete counterpart. 

Recall e.g. from [3] that if we let 

/? 

V£ = X)(cf(u(Ar)) - 1) , n > 

k=l 

be the Lukaciewicz walk associated with the forest f , then the height process of f is given 
by 

Hi = # \k £ {0, 1, . . . , n - 1} : V* = min vA . (28) 

k<l<n 

Under P, V F is a random walk on Z with centered step distribution /i(-+l) on { — 1, 0, 1,2,.. 

Now, suppose < s < t < A are such that ns,nt £ Z + . Write X(x) = max{Z £ 
[0, ns] : V t F < x}. Using (f2~Sj) . we have 



Tit _ Tit 



#lke [ns, nt) : V k F = min V, F \ 

I k<l<nt 1 J 



-# A mm V l \ < k < ns :V l = min Vf , (29) 

, ns<l<nt I k<Kns 
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and the rest of the proof will consist in estimating the moments of the two terms above, 
which correspond to the lengths of the branches of F from u(ns),u(nt) down to their 
most recent common ancestor. By the time reversal property for walks, 

(Vf } = V n F - Vl_ k ,0 < k<n)± (Vf ,0 < k < n), 

the first term in is equal in distribution, under P, to 

W n(t _ s) := # (l < k < n(t - s) : V k F = max vA , 

the number of (weak) records of V F before time n(t — s). 

Let M n = max <fc< n V F . Let r = 0, and Tj, i > 1 be the z-th record time, i.e. the z-th 
time r > 1 such that V F = M T . Then it is easy and well-known that (rj — t^-i, i > 1) are a 
sequence of i.i.d. random variables. Moreover, since is centered and its increments have 
finite second moment under P, it is a consequence of the proof of [HI XII, 7 Theorem la] 
and the discussion before that the Laplace exponent 0(s) = — log-E'[exp(— sti)] ~ Cs 1 ^ 2 
as s — > for some C > (Feller considers the case of strict ladder epochs, but the 
treatment of weak ones is similar). Now, for any p > 1, and integer u, 



POO 

E[W*] = p x p ~ l P (W u > x) dx 
Jo 

/oo I x \ 

x p ~ l P ( J2(r t - ri_i) < uj dx 



V 



fOO 

< pe I x p ^E 
o 



exp 




dx < c^u^yp < c"u p/2 , 



for some C, C" > and every u. Since ns, nt are distinct integers, we showed that 
E\Wn(t-s){ — Cin p l 2 \s — t\ p / 2 uniformly in such n, s, t, where C\ = Ci(p,p) > 0. 

Let us now handle the second term in (j2HJ). Using time- reversal, we see that this equals 

# <n(t - s) < k < nt A k ( max V, F | :VF = max V, 1 

{ \l<l<n(t-s) J n(t-s)<l<k 

in distribution, where k(x) = min{A; > n(t — s) : V F > x} (with the convention 
min0 = oo). By using Markov's property at time n(t — s), this has same distribution as 
W nsAK 0y[ ^ where W is defined as above, while V is an independent copy of 

V F with maximum process M. By monotonicity this is less than ^ )-v { )' "^ e ^ 

us prove that E[W P , J < Cx p for every x > 0, for some C > 0. To this end, notice that 



W„(t- S ) 



M n (t- S ) = (Kf-KL), (30) 



i=l 



3 PROOF OF THEOREMS 



38 



and it is a classical result of fluctuation theory that the variables Vj P — Vf are inde- 

J r i T i—1 

pendent with common distribution P(V£ = i) = fi([i + l,oo)),z > 0, so their mean is 
cr 2 /2, where a 2 is the variance of /i, and notice that these variables have small exponential 
moments. Now, the usual large deviations theorem shows that for some a, N > and for 
every n > N, 

— \ ^ < \ ) < exp(-an). (31) 
Now, using (J50j) in the second equality, 



POO 

E IK [X) ] = V \ u^ l P{W K{x) > u)du 

J 

,00 / M \ 

= p I uP ~ lp ( E« - kL) < x I du 



roo ( M 

^nD<-C)<7i^ 



< Px p I r t^Mt; + jf^ v^P ( g(V r f - V£j < ^ I dt; 



Now, as soon as x is large enough, i.e. 4xa 2 > N, where N is defined before f!31|) . the 
probability in the second integral is bounded by exp(— axv ) < exp(— v) if we further ask 
x > a" 1 . Thus the wanted bound on E[W P ^}. By the independence of V and V F , we 
conclude that 

^[(^(M„ (t _ s) -K l(i _ s)) ) P ] < CE[(M n[t ^-V n %_ s) f] (32) 



n(t-s)J J' 



where we used Doob's inequality E[M p , t _s] < (p/(p— ^)) p E[(V^ t _ s ^) p ], since ^ F is cen- 
tered. Now we use the following consequence of Rosenthal's inequality |TH1 Theorem 2.10]: 
if Xi, . . . ,X n are independent centered random variables (not necessarily identically dis- 
tributed) defined on some probability space (Cl,A,P), then for every p > 2 there exists 
C(p) such that 

n 

E[\X X + ...+ X n \ p ] < C(p)rfl 2 ~ l ]T E[\Xi\ p }. (33) 

i=l 

This shows that E[(V F [t _ s) ) p ] < C'{p)n p l 2 \s - t\ p l 2 for some C'ijp) > 0, for every s,t such 
that ns,nt G Z + , and therefore the same kind of upper bound holds for the quantity in 

432). 
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Putting things together, we have obtained that for every p > 2 and some Ci 
C 2 (fi,p) > 0, 



sup sup E 

ri>l s,t>0,ns,nt£Z+ 



— 11, 



nt 



n 



< Co\s-t\ p/2 . 



Let now H[ s \, s > be defined by linear interpolation between linear abscissa. Then, it 
is elementary that 



sup sup E 

n>l sj^t>0 



ttF _ ttF 

n {ns} n {nt} 



in 



<C*\s-t\ p/2 



for some C3 > 0. The uniform estimate in Kolmogorov's criterion ^Hl Theorem 3.4.16] 
finally entails the result, for K — 1. 

The general case is obtained by using the contraction function n^. We have, for 
< s, t < A and ns, nt e Z+, 



I u F — n F I <r 

\ rL ns -"nil — 



]n nW(F) _ „n«(P) 1 

I^Af^-l Af(rat)-ll 



2 max 

0<fc<An 



a Af(fc)-l 



ai&i 



On the one hand, by the case K = 1 and Proposition HJ we have with high P x -probability, 
uniformly in n G N, 

m-nWrF) n( J )(F) , 



since A F is a counting function. On the other hand, using the same inequalities as around 



max 

0<fc<An 



H, 



F Af(fc)-1 



< max 

0<k<An 



Anci?(l, u(k)) 



max 

0<fc<An 



\HtT-Hf HF) \ + l 



According to Proposition the first term on the right-hand side is bounded above by 
n l / A+1 with high probability for large n, where we chose < 7 < 1/4 — a, so that 
n i/4+7 <- ^jYt\s — t\ a for every large n (recall that ns, nt E Z + so that \s — 1\ > n' 1 ). As 
for the second term, by [T3*| Lemma 21] (while the statement is on conditioned trees, the 
first part of its proof yields the result on forests), it holds that it is o(n 7 ) in P x -probability 
for any 7 > 0, which gives the wanted bound. □ 
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3.4 Tightness 

This section is devoted to the proof of last building block needed to prove Theorem 01 
namely 

Proposition 9 Under the hypotheses of Theorem^ for every x G [K] N , the laws of the 
processes {{ n ~ l ^S F ns ^ , s > 0), n > 1) under P x are tight in the Skorokhod space D(R+, R) . 

Proof. In this proof, C*i, . . . , Ci will be denoting strictly positive constants. Our first 
task is to obtain an upper bound for expectations of the form E x [|S,f f — 5*,fJ p ]. To this end, 
we first choose £, D so that the moment condition (j3J) holds, and write p = 8 + £. Also, 
fix < a < 1/4 so that a(8 + £) > 2, < r] < 1/4, and c > such that c < 7//(4 log-fT). 
According to Proposition 13.31 we may choose C\ > so that if 

{I H F — H F I 
max ^ ^ < d 
0<s^t<A:ns,nteZ + n i 'i 2 \s — t\ a 

then P x (/l n ) > 1 — e/2 for every n > 1. We let £> n be the intersection of A n with the 
three events {max < fe <^ n cp{u{k)) < Clogn}, 

|Ancp (fc) (z, w, I, h) - ha i b Wl ( ( - i \w) l 
max max max max -7- < 1 

0<k<An nv<h<\u(k)\ wGWk,1<2<|w| ie[K] h 1 ' 2 ^ 



and 



{max max Anc6 (cf(v ) > c\ogn,h) < h 1 > 
0<k<An n v<h<\u{k)\ J 



where C>0,0<7<1 are chosen so that P x (B n ) > 1 — e for every n sufficiently large, 
which is possible according to Lemmas H3 and ^2 We take n G N, choose < s 7^ t < A 
be such that k = ns G Z + ,&' = nt G Z + , and write tt = u(k),u' = u(k'). Then, by 
definition, we have 

^k ~ si' = 2^ ^ v ~~ ^ ' 

uhu,|«|>|fi| tihu',|i;|>|n| 

whenever u is the most recent common ancestor to u and u' . Assume u = urw, u' = ur'w' 
for some r 7^ r' G N and w,w' GU. This allows to redisplay the previous expression as 

&k ~ Sk> = (Ytir — Yur') 

^ ^ ^ ^ ^ ^ ^ ^uzl{|'u|>|fi|,wj?(u)='vv,v2l _ u} 

weWx l<Z<|w| ie[A"] wl-UjV^ve^W 

— ^ ^ ^ ^ yvl^-{\v\>\u\,w F (v)=v,,vlhu'} ■ 

weW x l<K|w| i6[K] vhu' ,Vytu' ,veF(i) 

By construction, under E x , all the terms of this sum are independent of each other condi- 
tionally on F, except possibly for Y& r and Y^ r i. Let h — \u\ — \u\ — 1 and h! = \u'\ — \u\ — 1, 
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and let R(k, k') = \u\ + \u'\ — 2\u\ = h + h! + 2 be the number of random variables of the 
form Y v that are involved in the expression S£ — S#. Using (jHHj) for p = 8 + £ gives, 



V?[\S£-S£\ P \F] 
< C 2 R(k, k'Y' 2 - 1 x 



(34) 



E x [|n r - Y^\p\F] 



< c z R{k,k'yi 2 ~ l 



c F (u) D 



X 



+ Em>i ™ D Ewewx,|w|=m E™i E 4G [x] Anc£(z, w, Z, Zi) 
+ E m >i m ° EweWK,|w|=m E™i E 4eM Anc£(i, w, Z, zo 



Now, on the event B n , we have cf(u) d < (Clogra) D < n 1 / 2 ^ — t\ a , since |s — 1\ > 1/n. 

It remains to bound the two above sums, by symmetry it suffices to deal with the first 
one. In the case where h <n v and on the event B n , notice that 

m 

^ ^ ^ Ancp(i, w, Z, h) = ^ m D Anc F (c F (v) = m,h) , 

m>l weVVjr,|w|=m 2=1 ie[if] l<m<Clogn 

which is less than (C log n) D h < (C logn) ^, and this in turn is less than n l l 2 \s — t\ a > 
n i/2-a since ^ < 1/4 < 1/2 - a. 

Assume now that Zi > n 77 . Still on B n , it holds that Anc^(i, w, Z, Zi) < haib Wl Q 1 ' (w) + 
Zi 1 / 2 ^. We split the sum under consideration into 

mD ^ ^ Anc^(z, w, Z, Zi) + ^ m D Anc F (c F (v) = m,h) 

l<m<clogn weWjr,|w|=m i=l ie[Jf] clog n<m<C log n 

The second term is bounded by (C logn) £) Zi 7 by definition of _B n and since Zi > n 77 , hence 
by Cnjn. The first term is bounded above by 

CsZimax V |w| D+1 C W (w) + C 5 h 1/2+r <(C log n) D+1 #{w G W K ■ |w| < clogn}, 



and since #{w e W K ■ |w| < clogn} < K cl °g" 



n 



C log i<" 



< /jClOgX/g anc J ky Q^QJQg Q f 



c, the whole is bounded by C%h < C§R{k, k'). 

As in Proposition 27], we argue that the number R(k, k'), which equals |u| + \u'\ — 
2\u\ = h + h' + 2, satisfies R(k, k') < C- 7 n 1 ^ 2 \s — t\ a on B n , for all choices of n, s, t. Hence 
by inspection of all the cases discussed above, we have on B n , 



so that 



E x [|Sf - S£\ P \F} < C s (n^ 2 \t - s\ a f /2 

<C,\t-s\ ap/2 . 



E 



cF _ cF 

^ns nt 



11 



1/1 
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As in the previous proof, if we let (SV-,, s > 0) be the linearly interpolated version of 
(^jnsj' s — 0) between abscissa points of the form k/n,k > 0, then it is elementary that 
a similar bound holds up to taking a larger Cg, this time for all < s ^ t < A. Also, 
by our choice of a, we have ap/2 > 1. Hence, an application of Kolmogorov's criterion 
to { n ~ l ^ 4 S^ ns ^, s > 0) gives that for every A, e > 0, there exists C*io,/3 > such that for 
every n > 1, 



I cF _ OF I 

I {nt\ {ns}\ , 

P x max — Vtti \ir~ > C 1Q 

0<t^s<A n 1 / 4 \t - s\P - 



B n )<e, (35) 



and since P x (B n ) > 1 — e, we may as well forget the conditioning on B n . 

This is enough to conclude that the laws of the continuous processes {n^^Sf,, s > 
0) under P x form a tight sequence, and the result will follow from the fact that these 
processes are respectively uniformly close to ("- _1 ^ 4 5'|^ s j , s > 0) over compact intervals. 
This immediately comes from maxi^^^ |^5 fe J = o(n 1 / 4 ) in probability, which itself can 
be inferred from the fact that with high probability, no vertex u(k) with < k < An has 
more than Clogn children, and the moment control (JH). Hence the result. □ 



3.5 Conditioned results: Theorem HI 

Obtaining Theorem 0] from Theorem El now consists in reproducing faithfully the proofs of 
Propositions [Hand El so we only sketch the plan of the proof. The important results that 
are needed are generalizations to conditioned measures of Lemmas El and El where P x 
must be replaced by P^ l \-\^T^ = n). This is straightforward from the latter lemmas and 
Lemma El which we use as we did around (|19)l . This is enough to obtain the exact analog 
of Proposition for the conditioned probabilities P^\-\^-T^> = n) (it is even simpler as 
the issue encountered at the very end of the proof of that proposition disappears). 

The analog of the tightness statement (Proposition El) is then a consequence of the 
following version of Proposition El for conditioned measures. 

Proposition 10 Assume that \i satisfies (H) and admits small exponential moments. 
For every i,j G [K], for every a G (0, 1/4) and e > 0, there exists C > such that 



P p«( sup |g ^i J , H \*^ l >c 

neN \s^t€ [0,1], #Ts,#TtGZ + yf U\S — t\ a 



#T (J ' } = n \ <e. 



Proof. We rest on ^Hl Theorem 24], which is essentially the monotype result (K = 1), 
with the extra freedom that we consider the conditioned law P}{-\#F = n) of a forest with 
r components. Indeed, when applying the mapping to T under P^\-\^T^^ = n), 
one obtains such a conditioned forest with a random number of roots, although the laws 
of these random numbers form a tight sequence by Lemma Hence, up to conditioning, 
we can assume that this number of roots is fixed and apply the monotype result. The 
conclusion is then the exact analog of the last lines of the proof of Proposition El Details 
are left to the interested reader. □ 
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